Hybrid Methods for Continuous State Dynamic Programming
نویسندگان
چکیده
We propose a method for solving continuous-state and action stochastic dynamic programs that is a hybrid between the continuous space projection methods introduced by Judd and the discrete space methods introduced by Bellman. Our hybrid approach yields a smooth representation of the value function while preserving the computational simplicity of discrete dynamic programming. The method is especially well suited for implementation in a vector processing environment such as MATLAB or GAUSS, and makes it possible to automate the setup and solution of continuous space dynamic programs in a way that previously seemed elusive. Dynamic programming, while familiar and often the natural way to handle many decision problems, has not seen widespread application. In part this is due to the inherent limitations due to the curse of dimensionality: problems with many state variables become too large to solve. Another reason, however, is surely due to the diiculties one faces in formulating models and solving them. The practice of dynamic programming seems to be the domain of specialists who program each problem anew. This need not be the case. Discrete time dynamic programming problems, for which the state and policy variables can take on a discrete nite set of values, can be completely characterized by a discount factor and two matrices, one describing the current reward function and the other describing the state transition function. Solution methods in this case require maximization over the elements in each column of a matrix and simple matrix arithmetic (addition, multiplication and a linear solve operation). We propose here a similar method for problems with continuous state and action variables that is a hybrid between projection and discretization methods and that retains the best features of both. With this approach one can characterize a DP problem with a discount factor and three matrices, with the addition, for stochastic 1
منابع مشابه
Stochastic Dynamic Programming with Markov Chains for Optimal Sustainable Control of the Forest Sector with Continuous Cover Forestry
We present a stochastic dynamic programming approach with Markov chains for optimal control of the forest sector. The forest is managed via continuous cover forestry and the complete system is sustainable. Forest industry production, logistic solutions and harvest levels are optimized based on the sequentially revealed states of the markets. Adaptive full system optimization is necessary for co...
متن کاملImproving the Energy Management of Parallel Hybrid Electric Vehicle by Dynamic Programming Using Electro-Thermal Model of Battery
In this paper, an offline energy management system (EMS) is proposed for parallel hybrid electric vehicles (HEVs). The proper energy management system is necessary for dividing torque between electrical motor and Internal Combustion Engine (ICE). The battery is a crucial component of hybrid electric vehicles and affects significantly the cost and the performance of the whole vehicle. The primar...
متن کاملApproximate Dynamic Programming Recurrence Relations for a Hybrid Optimal Control Problem
This paper presents a hybrid approximate dynamic programming (ADP) method for a hybrid dynamic system (HDS) optimal control problem, that occurs in many complex unmanned systems which are implemented via a hybrid architecture, regarding robot modes or the complex environment. The HDS considered in this paper is characterized by a well-known three-layer hybrid framework, which includes a discret...
متن کاملA Hybrid Dynamic Programming for Inventory Routing Problem in Collaborative Reverse Supply Chains
Inventory routing problems arise as simultaneous decisions in inventory and routing optimization. In the present study, vendor managed inventory is proposed as a collaborative model for reverse supply chains and the optimization problem is modeled in terms of an inventory routing problem. The studied reverse supply chains include several return generators and recovery centers and one collection...
متن کاملConvex dynamic programming for hybrid systems
A classical linear programming approach to optimization of flow or transportation in a discrete graph is extended to hybrid systems. The problem is finite dimensional if the state space is discrete and finite, but becomes infinite dimensional for a continuous or hybrid state space. It is shown how strict lower bounds on the optimal loss function can be computed by gridding the continuous state ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007